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Some  elementary  tests  for  mixtures  of  discrete  distribution 

by 

J.  Tiago  de  Oliveira 

The  problems  dealt  with  in  this  paper  arose  in  the 
following  context. 

It  is  known,  in  some  paleontological  problems,  that  the 
distribution  of  some  countable  (meristic)  characteristics 
of  neighboring  biological  species,  isolated  in  well-defined 
geographical  areas,  have  some  fixed  (or  stable)  distribution. 

In  some  cases,  however,  the  analysis  of  the  frequency 
polygon  seemed  to  suggest  the  existence  of  a  mixture  of  two 
populations  and  the  test  in  3)  was  devised  for  the  purpose  of 
deciding  about  the  hypothesis  of  the  mixture. 

After,  some  of  the  results  were  extended. 

1 .  Introduction 

Let  P  =  {p.,5  and  Q  =  (q  )  be  discrete  distributions  with 

.L  i 

the  same  discrete  support  ft  =  (x^),  which  is  a  subset  of  the 
real  line,  and  let  X  be  a  random  variable  whose  (discrete) 
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distribution  is  the  mixture  ('»,  1  -  <j>)  (0  u>  l)  of  the 
distributions  P  and  Q,  called  the  components;  some  of  the 
values  pi  or  q.^  may  be  zero. 

As  it  is  very  difficult  to  work  out  explicitly  the  best 
tests  (and  estimators)  for  the  existence  of  the  mixture  (that 
is,  the  hypothesis  that  m  ^  0,  l)  we  will  develop  some  (quick) 
tests  with  a  lower  efficiency  (non~evaluated) .  The  tests  we 

l 

will  study  are  based  in  the  first  moments  of  some  random 
variable,  dependent  on  the  random  variable  X  as  it  is  usual 
(Rider,  1961) .  The  technique  followed  is  connected  with  some 
previous  results  of  Tiago  de  Oliveira  (i960) . 

Let,  then,  g(x)  be  some  function  whose  domain  of  definition 
contains  the  support  R  and  such  that  it  has  mean  values  and 
variances  relative  to  P  and  Q,  denoted  respectively  by  p,p, 

|iQ,  <jp  and  aQ  .  The  new  random  variable  Y  =  g(X)  has,  then 
the  following  mean  and  variance  ' 
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m(y)  ■  <»  |Xp  +  (i  -  <»)  [iQ 

v(y)  -  (0  o2p  +  (1  -  t i>)  a2Q  +  2  <0  (1  -  o>)  (pip  -  SXQ}2 

As  it  is  well  known  (Cramer,  1946)  if  (X,.  is  the  k~th  moment 
of  Y  and 

M.  «  ---  S  y . "K  is  the  k~th  sample  moment, 

K  n  ^  x  a 

,,/n  -  |Xy.)  is  a  random  variable  with  an  asymp¬ 

totically  normal  distribution  with  zero  mean  and  variance 
!lgn  -  P«2  and  the  random  pair  Q/n  Pgf),  Jn  p,J  J 

is  asymptotically  normally  distributed  with  zero  means, 

2  o 

variances  u  -  fj^  and  and  covariance  -  u^ja^ 

2.  Components  fully  known;  Let's  suppose  that  the  components 
P  and  Q  are  ful.ly  known  and  finite.  The  fact  that 

M[n^-  (u>  [J.p  +  (l-<fl)p  ) 

Jn  . . .  . 

Jw2  +  (1hb)o®  +  2«)(l -m)}. ipixQ ) 2 

is  an  asymptotically  normal  random  variable  with  zero  mean 
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and  unit  variance  suggests  the  use  of 

(jJ»  _ 

n 

M-p  “  Pq 

2 

of  an  estimator  of  is,  because  the  values  Up  a  u-q*  ©p  and  a 

are  known  as  a  consequence  of  the  full  knowledge  of  F  and  Q „ 

» 

The  asymptotic  variance  of  at  is,  then. 


I! 

n 


m  Op  +  (1-0))  <r; 


2 


“  P, 


,2 


+  2a) 


(1-)] 


Q1 


and,  consequently,  to  obtain  the  maximum  discrimination  we 
would  choose  the  function  gfx)  such  that  the  values 

v-i  =  g(x±) 

would  lead  to  a  minimum  variance  whatever  may  be  the  value  of 
a),.  This  suggests  the  use  of  a  function  of  such  that 

*p  O  p 

Op  ~  and  (p.p  -  Pq)"  a  maximum . 

As  we  can  make  a  linear  transformation  of  the  values  y, 

x 

without  altering  the  result,  which  is  invariant  for  these 


cw  a 
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transformations,  we  can  impose  then 


aP  ~  1 


~  1 
aQ 


V-Q  ~  0 

as  condition  equations  and,  then  search  the  values  y^  which 
maximise  \jf~y . 

The  technique  of  the  Lagrange  multipliers  leads  to  the 


solution 


Y* 


a{^y)P±  -  (t  qs 

£  qjL  +  Y  P± 


where  a,  3,  Y  are  determined  by  the-  condition  equations 
2  P±  “  (2  P^)2  -  1  ' 

E  m  0 

z  q±yi2  =  1 


It  is  easy  to  see  that  2  p^y..  =  a  and  a  =  3+Y-  hs  a  will  fee 


determined  from  the  condition  equations  except  for  the  sign 
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we  can  always  choose  a  such  that  =  2  p,y,  >  0 

In  the  greater  part  of  the  problems  the  effective  deter¬ 
mination  of  the  y..  by  the  solution  of  the  3  (condition) 
equations  on  a,  Y  is  a  difficult  one*  In  these  cases 

we  will  only  choose  the  y.  such  that 

J  x 

o'?  =  af.  ~  1  and 
P  Q 

=  0 

which  is  always  possible . 

Cnee  determined  or  chosen  the  values  of  y^  we  can  proceed 
to  the  test.  &b  w  -  0  or  a )  ~  1  we  know  that  Jn  -  |xp) 


are  asymptotically  normal  with  zero  mean  and  unit  variance, 

/  \ 

the  values  M' ,  tend  to  concentrate  around  =  0  or  u, ,  >  0 
according  to  <w=  0  or  os  =  1.  Let's  fix  then  an  asymptotic  level 


of  significance  e  and  take  \g  such  that 


r 


1  _  -L 

—  e  2  dx  =  1  -  e 
/  2rr 
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and  act  as  follows: 


s  { n )  Xe 

1)  if  M]  '  —  we  will  accept  tie  hypothesis  to  =  '0 

■M 


An)  v  ..  xe 


2)  if  ^  [iB  -  — -  we  will  accept  tne  hypothesis  >»  =  1 

»/ta 


3)  if  ™  <  i.i 


Jn 


X  -■*  :--p  -  ~z  we  will  accept  the  ?aypothesis 
\/n 


of  .the  existence  of  a  mixture;  steps  3.)  and  2)  led  to  the 


rejection  of  this  hypothesis. 

Let's  denote  '.by  p|n^(«i)3  (ijb)  and  {«))  the  proba¬ 


bilities  of  the  decisions  l) s  2)  and  3)  if  the  mixture  is 

mC**)  . 


(t8Js  1  -  «j)  .  We  have  A& 


'i)U.p 


■A 


) 


1  -i- 


p{“)  (») 


profo  ■[  21  ^  w 


Xe 


«HAp  «/n 


/ 


1  +  2®{i'-ia)hp  V1  +  2«>  { 1  -oj )  fi: 


O 


Aa) 

’2 


(i»)  ~  prob  i  S.  > 

L  a! 


( 1  —(o )  j.i.p  Jn 


X, 


!1  2(U  ( 1  )  hp  +  2«)(l-m)!i| 


} 
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and  (m)  -  1  --  («0  ™  (u>)  and  passing  to  the 

limit,  we  obtain 

lira  p[n)(G)  =  1  -  e,  lire  p[n^  (or)  -  0  if  w  £  0  ; 

n=>oo  n=>“ 


lira  pj^(l)  =  1-3,  l:lsa  pW  {,»)  ,=  o  if  a)  /  1  ? 
ne*®0  “  n=>®  ’  * 


lira  P^(0)  «  lira  P^(l)  =  e,  lira  pin^(w)  =  1  if  w  /  0S  1 


The  test  leading  to  the  solution  of  the  trileiEma  is  then  a 
consistent  one  with  the  asymptotic  level  e  for  the  hypotheses 
os  s=  0  and  «)  =  X„. 

It  is  evident  that  the  same  technique  my  he  applied 


even  if  we  don't  have  random  variables  tout  only  random  attributes, 
giving  to  each  attribute;  whose  pair  -  of  probabilities  is 


the  value  y^  =  gfx.^)  . 


# 

Implicitly  we  have  shown  that  to  ■  =  — =— 

n  jj.p 


is  an  estimator 


of  <#.  Other  estimators  may  also  be  obtained.  The  maximum  like- 

p 

lihood  and  minimum  x"  methods  are  unworkable .  The  minimum  x 
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to  ®n  ;  its  mean  value  is  asymptotically  cu  and  its  variance 
is  a  cumbersome  one. 

I 

3°  The  translation  mixture  case:  We  shall  suppose,  now,  that 
the  discrete  components  (finite  or  enumerable)  P  and  Q  have 
the  probabilities  of  the  form 

p±  =  cp(i  -  9p)  and  q±  -  cp(i  -  9Q) 

where  the  outcomes  are  equally  spaced  and  so  that  for  simplicity 
we  suppose  ftto  be  the  set  of  the  integers;  rp(n)  is  a  function 
such  that 

2  cp(n)  =  1  and  cp(n)  ;>  0 
and  we  will  suppose,  also,  that 

H.  =  2  n  T>(n)  and 
cr2  =  2  n2  rp(n)  -  | x2 

exist  (and  are  known) .  In  the  sequel  we  will  need  also  the 
existence  of  the  4th  moment.  Then  we  have 
M(x)  *  +  cu  9^  +  (l-«»)  9q 

V(x)  =  a2  +  2w(l-u>)  (9p  -  9q)2  . 


i 
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As  the  variance  Increases  with  the  mixture  we  will  use  as  a 


test  statistic  the  variance  of  the  sample  S*  ,  the  test  will 
be  one-sided.  As  the  asymptotic  distribution  of 


V{$2+2)  V(x) 

/  ^4 

is  a  normal  one  with  aero  mean  and  unit  variance  =  — g - 3 

42 

denoting  the  kurtosis  or  excess  coefficient)  we  will  reject 

the  hypothesis  of  mixture,  on  the  asymptotic  level  of  significance 


e,  if 


9  2 

S  -  a 


and  accept  it  if 


S2  -  a2 


>  Xe  >  0g  +  2)0  denoting 


denoting  the  kurtosis  coefficient  af  cp(n) . 

The  probability  of  rejection  of  the  hypothesis  of 
mixture  is  then,  for  the  mixture  (<d,  1  -  <jo). 


e  ^  0  ^ 

Pn  (<«)  -  Prob  {s2  £  a2  +  c  ~  */Tp2+2)0  } 
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\ 

for  «  =  0  or  'J  »  1  we  have 

lira  P  (0)  =  lira  (l)  =  1  -  € 
n=»«  n=»® 

and  if  <u  ^  0  we  obtain 

lim  P  (u>)  =»  0 
rr*® 

the  test  is  then  a  consistent  one. 

In  concrete  cases,  such  as  the  biological  problems  of 
meristic  characteristics,  we  can  use  the  past  experience  to 
obtain  a  "scheme"  of  the  values  of  rp  (n)  and  then  apply  this 
technique . 

4.  Poisson  mixtures:  Let  now  Xp  and  XQ  be  the  parameters  of 

two  Poisson  components  of  a  mixture;  ft  is  the  set  of  non-neigative 

2  2 

integers.  As  up  ~  dp  ■  Xp  and  =  X^  we  have 

M(x)  =  uj  Xp  +  (l-'u)  XQ 

V(x)  =  uu  Xp  +  ( l-io)  XQ  +  2u)(l-ui)  {Xp  -  XQ)2 
We  have  then,  V(x)  =  M(x)  if  uu  =  0  or  m  *  1  and  v(x)  >  M(x)  if 
a)  3^  0,  1  .  This  suggests,  as  a  test  statistic,  the  difference 
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between  the  sample  variance  and  mean  and,  also,  the  use  of 
a  one-sided  test. 

Whatever  may  be  uu  we  have 
M(s|  -  Mft)  -  2|i)(l-t»)  (Xp  -  XQ)S 

V(S^  -  Mn)  ~  ^ — — -  ;  for  #  »  0  or  w  -  1,  the 

function  b(«),  XQ)  reduces  to  b(x)  =  1  -  2  JK  +  3X. 

The  asymptotic  distribution  of 


Sn  “  Mn 

rjn 


^b  ( (B ,  Xp }  Xq) 


is  normal  with  aero  mean  and  unit  variance. 

As  for  the  hypothesis  »  *  0  or  »  a  1,  b  reduces  to 

b(A),  and  an  estimator  of  Xp  (or  Xq)  is  the  sample  mean  Mn, 

b(Mn)  is  an  estimator  of  b(x)  owing  to  the  continuity  of  b 

(Cramer,  1946), and  the  asymptotic  distribution  of 

2 

3C  - 


./n,  — — 


<V 


is  also  normal  with  zero  mean  and  unit  variance. 
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The  one-sided  test  is  then  the  following: 
reject  the  hypothesis  of  mixture  if 

*'515? 

and  accept  it  if 

sn  -  «n  >  ^  ■ 

The  probability  of  rejection  of  the  hypothesis  of  mixture 

is  then 

®r>>  ■  K  -  £.  -=  *€1^7  } 

and  by  the  asymptotic  normality  referred  we  see  that 

lira  P  (0)  «  lim  P  (l) ■  «  1  -  e 
n=»°°  n*«>  n 


and 


lim  P  ('ju)  =0  w  £  0,  1. 
n=»«  n 


which  shows  the  consistency  of  the  test. 


I 
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